skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Nguyen, Thuan"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract We consider estimation of the mean squared prediction error (MSPE) for observed best prediction (OBP) in small area estimation with count data. The OBP method has been previously developed in this context by Chen et al. (Journal of Survey Statistics and Methodology, 3, 136–161, 2015). However, estimation of the MSPE remains a challenging problem due to potential model misspecification that is considered in this setting. The latter authors proposed a bootstrap method for estimating the MSPE, whose theoretical justification is not clear. We propose to use a Prasad–Rao‐type linearization method to estimate the MSPE. Unlike the traditional linearization approaches, our method is computationally oriented and easier to implement in the same regard. Theoretical properties and empirical performance of the proposed method are studied. A real‐data application is considered. 
    more » « less
    Free, publicly-accessible full text available December 1, 2025
  2. null (Ed.)
    Cancer incidence and mortality are typically presented as age-standardized rates. Inference about these rates becomes complicated when denominators involve sampling errors. We propose a bias-corrected rate estimator as well as its corresponding variance estimator that take into account sampling errors in the denominators. Confidence intervals are derived based on the proposed estimators as well. Performance of the proposed methods is evaluated empirically based on simulation studies. More importantly, advantage of the proposed method is demonstrated and verified in a real-life study of cancer mortality disparity. A web-based, user-friendly computational tool is also being developed at the National Cancer Institute to accompany the new methods with the first application being calculating cancer mortality rates by US-born and foreign-born status. Finally, promise of proposed estimators to account for errors introduced by differential privacy procedures to the 2020 decennial census products is discussed. 
    more » « less
  3. null (Ed.)
    In recent years, the need to exploit digitized document data has been increasing. In this paper, we address the problem of parsing digitized Vietnamese paper documents. The digitized Vietnamese documents are mainly in the form of scanned images with diverse layouts and special characters introducing many challenges. To this end, we first collect the UIT-DODV dataset, a novel Vietnamese document image dataset that includes scientific papers in Vietnamese derived from different scientific conferences. We compile both images that were converted from PDF and scanned by a smartphone in addition a physical scanner that poses many new challenges. Additionally, we further leverage the state-of-the-art object detector along with the fused loss function to efficiently parse the Vietnamese paper documents. Extensive experiments conducted on the UIT-DODV dataset provide a comprehensive evaluation and insightful analysis. 
    more » « less